DNA encoding a constitutive activator retinoic acid response (CAR) receptor

ABSTRACT

Purified DNA encoding a Constitutive Activator of Retinoid acid response (CAR) receptors and the recombinant proteins expressed from such DNA are disclosed. The recombinant receptor polypeptides are members of the nuclear hormone receptor super family and are used to identify CAR ligands and CAR receptor binding sites and are also used to produce therapeutics. Antibodies specific for CAR receptor polypeptides are also disclosed.

This is a divisional of copending application Ser. No. 07/843,350, filed Feb. 26, 1992.

BACKGROUND OF THE INVENTION

This invention relates to receptors, particularly nuclear hormone receptors.

In higher organisms, the nuclear hormone receptor superfamily includes approximately a dozen distinct genes that encode zinc finger transcription factors, each of which is specifically activated by binding a ligand such as a steroid, thyroid hormone (T3) or retinoic acid (RA). However, there is an additional, somewhat larger group of cDNAs that encode proteins that do not bind or respond to any known ligand. These members of the superfamily are called orphan receptors. While the role of the better characterized conventional receptors in regulating important processes in developing and adult individuals is becoming clearer, the function of the orphan receptors has been uncertain.

A number of the conventional and orphan members of the superfamily share identical or very similar amino acid sequences in an important region of the first zinc finger. Both genetic analyses and X-ray crystallography indicate that this region, termed the P box, makes sequence specific contacts with the DNA. The conventional receptors in this P box-defined subgroup include those that bind estrogen, vitamin D, T3 and RA, and nearly all of the orphan receptors identified to date also fall into this class. As a consequence of this overlap in binding specificity, many hormone response elements can bind more than one type of receptor. The best characterized of these is the element upstream of the rat growth hormone gene, which can be activated by three different isoforms of the T3 receptor encoded by two different genes and by an unknown number of retinoic acid receptor isoforms encoded by three different genes. While it does not appear to respond to the estrogen receptor or the vitamin D receptor, its response to other members of the subgroup remains uncertain.

Recently the potential complexity of the interactions of the conventional receptors with their response elements has been substantially increased by the demonstration that the three closely related RXR proteins can form heterodimers with the thyroid hormone, retinoic acid and vitamin D receptors. These heterodimers show higher binding affinity for appropriate response elements, and the RXRs are hypothesized to play central roles in signal transduction by all three classes of receptors. The impact on such heterodimers of the binding of the retinoid metabolite 9-cis retinoic acid by the RXRs remains unclear.

SUMMARY OF THE INVENTION

In general, the invention features substantially pure CAR receptor polypeptide. Preferably, such a receptor polypeptide includes an amino acid sequence substantially identical to the amino acid sequence shown in FIG. 1 (SEQ ID NO:1).

The invention further features a substantially pure polypeptide which includes a CAR receptor DNA binding domain and a CAR receptor gene activation domain. Preferably, the DNA binding domain includes a sequence substantially identical to amino acids 11-75 of FIG. 1 (SEQ ID NO:10), or a DNA binding fragment thereof; and the gene activation domain includes a sequence substantially identical to amino acids 76-348 of FIG. 1 (SEQ ID NO:10), or a gene activating fragment thereof.

In a related aspect, the invention features a substantially pure polypeptide which includes a CAR receptor heterodimerization domain.

In preferred embodiments of various aspects, the receptor polypeptide is mammalian, and preferably, human.

In yet other aspects, the invention features substantially pure DNA which encodes a CAR receptor polypeptide of the invention. Preferably, such DNA is cDNA; and encodes a human CAR receptor polypeptide. The invention also features a vector which includes such substantially pure DNA and which is capable of directing expression of the protein encoded by the DNA in a vector-containing cell. Finally, the invention features a cell which contains the substantially pure DNA. Preferably, the cell is a eukaryotic cell, for example, a mammalian cell.

In another aspect, the invention features a method of producing a recombinant CAR receptor polypeptide (or a fragment or analog thereof). The method involves (a) providing a cell transformed with DNA encoding a CAR receptor or a fragment or analog thereof positioned for expression is the cell; (b) culturing the transformed cell under conditions for expressing the DNA; and (c) isolating the recombinant CAR receptor polypeptide.

In yet another aspect, the invention features a substantially pure antibody which specifically binds a CAR receptor polypeptide of the invention.

In yet other aspects, the invention features therapeutic compositions which include as an active ingredient a CAR receptor polypeptide of the invention formulated in a physiologically-acceptable carrier. Such therapeutic compositions may be used in methods of treating Graves' disease or cancer (for example, lung cancer) in a mammal; such methods involve administering the therapeutic composition to the mammal in a dosage effective to decrease thyroid hormone receptor function (for the treatment of Graves' disease) or in a dosage effective to increase retinoic acid receptor expression (for the treatment of cancer).

In yet other aspects, the invention features methods of identifying a CAR ligand. One method involves (a) providing a nucleic acid sequence which encodes a CAR receptor polypeptide; (b) introducing into a host cell which is functionally deficient for CAR receptor (i) the nucleic acid which encodes the CAR receptor polypeptide (preferably, a CAR receptor polypeptide including an amino acid sequence substantially identical to the amino acid sequence shown in FIG. 1; SEQ ID NO: 10) and (ii) a reporter gene operably linked to a CAR receptor polypeptide binding site (preferably, the binding site GGGTAGGGTTCACCGAAAGTTCACTCG; SEQ ID NO: 5); (c) measuring induction of the reporter gene in the transfected host cell; (d) contacting the transfected host cell with a candidate ligand; and (e) measuring induction of said reporter gene in the presence of the candidate ligand, an increase or decrease in the induction as compared to the induction in (c) being indicative of the presence of a CAR ligand.

The second method involves (a) providing a nucleic acid sequence which encodes a CAR receptor polypeptide (preferably, including an amino acid sequence substantially identical to the amino acid sequence shown in FIG. 1; SEQ ID NO: 10); (b) introducing the nucleic acid into a host cell so that the recombinant CAR receptor polypeptide is expressed; (c) isolating the recombinant protein; (d) immobilizing the recombinant protein on a solid substrate (preferably, a column); (e) contacting the immobilized recombinant protein with a candidate ligand under conditions which allow formation of an affinity complex between the immobilized recombinant CAR receptor polypeptide and the candidate ligand; and (f) detecting complex formation as an indication of the presence of a CAR ligand.

In yet other aspects, the invention features methods of identifying a CAR receptor DNA binding site. One method involves (a) providing a nucleic acid sequence which encodes a CAR receptor polypeptide (preferably including an amino acid sequence substantially identical to the amino acid sequence shown in FIG. 1; SEQ ID NO: 10); (b) introducing into a host cell which is functionally deficient for CAR receptor (i) the nucleic acid which encodes the CAR receptor polypeptide and (ii) a reporter gene which is operably linked to a candidate CAR receptor DNA binding site; and (c) measuring induction of the reporter gene in the transfected host cell, induction being indicative of the presence of an operably linked CAR receptor DNA binding site.

A second method involves (a) providing a nucleic acid sequence which encodes a CAR receptor polypeptide (preferably, including an amino acid sequence substantially identical to the amino acid sequence shown in FIG. 1; SEQ ID NO: 10); (b) introducing the nucleic acid into a host cell so that the recombinant CAR receptor polypeptide is expressed; (c) isolating the recombinant protein; (d) contacting the recombinant protein with a candidate DNA binding site under conditions which allow formation of an affinity complex between the recombinant CAR receptor polypeptide and the candidate binding site; and (e) detecting complex formation as an indication of the presence of a CAR receptor DNA binding site.

In a final aspect, the invention features chimeric receptors. Such chimeric receptors may include the DNA binding domain of a CAR receptor polypeptide (preferably, including a sequence substantially identical to amino acids 11-75 of FIG. 1; SEQ ID NO:1 or a DNA binding fragment thereof) fused to the gene activation (and, preferably, the ligand binding domain) of a heterologous protein, preferably, a nuclear hormone receptor, or a protein chosen from the group consisting of: glucocorticoid receptor, α-retinoic acid receptor, β-retinoic acid receptor, γ-retinoic acid receptor, estrogen receptor, progesterone receptor, vitamin D receptor, mineralocorticoid receptor, thyroid receptor, VP16, and GAL4; or the chimeric receptor may include the gene activation domain of a CAR receptor polypeptide (preferably, including a sequence substantially identical to amino acids 76-348 of FIG. 1; SEQ ID NO:10, or a gene activating fragment thereof) fused to the DNA binding domain of a heterologous protein, preferably, a nuclear hormone receptor or a protein chosen from the group consisting of: glucocorticoid receptor, α-retinoic acid receptor, β-retinoic acid receptor, γ-retinoic acid receptor, estrogen receptor, progesterone receptor, vitamin D receptor, mineralocorticoid receptor, thyroid receptor, and GAL4.

By "CAR receptor polypeptide" is meant a polypeptide which is capable of binding to a DNA sequence of GGGTAGGGTTCACCGAAAGTTCACTCG (SEQ ID NO: 5) and activating the expression of downstream genes, even when mammalian cells harboring the receptor are grown in medium containing charcoal-stripped serum; in particular, a CAR receptor polypeptide activates such gene expression in the absence of retinoic acid.

By "substantially pure" is meant that the CAR receptor polypeptide provided by the invention is at least 60%, by weight, free from the proteins and naturally-occurring organic molecules with which it is naturally associated. Preferably, the preparation is at least 75%, more preferably at least 90%, and most preferably at least by extraction from a natural source (e.g., a mammalian liver cell); by expression of a recombinant nucleic acid encoding a CAR receptor polypeptide, or by chemically synthesizing the protein. Purity can be measured by any appropriate method, e.g., column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. By a "polypeptide" is meant any chain of amino acids, regardless of length or post-translational modification (e.g., glycosylation)

By "substantially identical" is meant an amino acid sequence which differs only by conservative amino acid substitutions, for example, substitution of one amino acid for another of the same class (e.g., valine for glycine, arginine for lysine, etc.) or by one or more non-conservative substitutions, deletions, or insertions located at positions of the amino acid sequence which do not destroy the function of the protein or domain (assayed, e.g., as described herein). A "substantially identical" nucleic acid sequence codes for a substantially identical amino acid sequence as defined above.

By a "DNA binding domain" is meant a stretch of amino acids which is capable of directing specific polypeptide (e.g., receptor) binding to a particular DNA sequence.

By "gene activation domain" is meant a stretch of amino acids which is capable of inducing the expression of a gene to whose control region it is bound.

By "heterodimerization domain" is meant a stretch of amino acids which is capable of directing specific complex formation with a heterologous protein; such a domain may direct the formation of dimers, trimers, tetramers, or other higher order hetero-oligomers.

By "substantially pure DNA" is meant DNA that is free of the genes which, in the naturally-occurring genome of the organism from which the DNA of the invention is derived, flank the gene. The term therefore includes, for example, a recombinant DNA which is incorporated into a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA or a genomic or cDNA fragment produced by PCR or restriction endonuclease digestion) independent of other sequences. It also includes a recombinant DNA which is part of a hybrid gene encoding additional polypeptide sequence.

By "transformed cell" is meant a cell into which (or into an ancestor of which) has been introduced, by means of recombinant DNA techniques, a DNA molecule encoding (as used herein) a CAR receptor polypeptide.

By "positioned for expression" is meant that the DNA molecule is positioned adjacent to a DNA sequence which directs transcription and translation of the sequence (i.e., facilitates the production of, e.g., CAR receptor polypeptide).

By "a host cell which is functionally deficient for CAR receptor" is meant a cell, e.g., a mammalian cell, which exhibits little or no CAR receptor-mediated gene stimulatory activity; such activity may be measured in standard transactivation assays using, e.g., a transfected reporter gene operably linked to a CAR binding site as described herein.

By "substantially pure antibody" is meant antibody which is at least 60%, by weight, free from the proteins and naturally-occurring organic molecules with which it is naturally associated. Preferably, the preparation is at least 75%, more preferably at least 90%, and most preferably at least 99%, by weight, antibody, e.g., CAR receptor-specific antibody. A substantially pure CAR receptor antibody may be obtained, for example, by affinity chromatography using recombinantly-produced CAR receptor polypeptide and standard techniques.

By "specifically binds", as used herein, is meant an antibody which recognizes and binds CAR receptor polypeptide but which does not substantially recognize and bind other molecules in a sample, e.g., a biological sample, which naturally includes CAR receptor polypeptide.

By "reporter gene" is meant a gene whose expression may be assayed; such genes include, without limitation, chloramphenicol transacetylase (CAT) and β-galactosidase.

By "operably linked" is meant that a gene and a regulatory sequence(s) are connected in such a way as to permit gene expression when the appropriate molecules (e.g., transcriptional activator proteins) are bound to the regulatory sequence(s).

By "heterologous" is meant any protein other than the human CAR receptor which includes a suitable (i.e., a DNA binding, gene activation, and/or ligand binding) domain.

Other features and advantages of the invention will be apparent from the following description of the preferred embodiments thereof, and from the claims.

DETAILED DESCRIPTION

The drawings will first briefly be described.

DRAWINGS

FIG. 1 is the nucleic acid sequence and deduced amino acid sequence of the human CAR receptor (SEQ ID NOS:1-10).

There now follows a description of the cloning and characterization of a human CAR receptor-encoding cDNA useful in the invention, This example is provided for the purpose of illustrating the invention, and should not be construed as limiting.

Isolation of the CAR Receptor

A human liver cDNA library was screened by standard techniques with the following degenerate oligonucleotide probe: TG C/T GAG GGI TG C/T AAG G G/C ITT C/T TT C/T A/C G (SEQ ID NO. 2). This probe was based on the sequence of the P box of the thyroid/retinoid receptor subgroup, the most highly conserved portion of the DNA binding domain. As expected, a number of clones encoding previously described members of the nuclear receptor superfamily were isolated. Based on limited sequence analysis (by standard techniques), one clone that did not correspond to any previously reported cDNA was chosen for further analysis. The complete sequence of this cDNA is presented in FIG. 1 (SEQ ID NO: 1).

As indicated in FIG. 1, this cDNA encodes a protein of 348 amino acids that contains conserved features of the nuclear receptor superfamily in both the DNA binding (C) domain and the putative ligand binding/dimerization (E) domain. Because of the activities of this protein described below, it is called CAR (Constitutive Activator of Retinoic acid response elements). CAR is one of the smallest superfamily members, with the shortest known A/B domain.

As expected from the oligonucleotide used for screening, the sequence of the P box DNA binding specificity determining region of the first zinc finger placed CAR in the thyroid/retinoid group. CAR was not strongly related to any other superfamily member, but was most similar to the vitamin D receptor, sharing 42 identical amino acids out of 66 in the C domain (64%), and 61/152 in the E domain (40%). This is quite similar to the relationship between the thyroid hormone and retinoic acid receptors: TPβ and RARβ which share 60% and 40% sequence identity in the C and E domains, respectively. By comparison, the closely related human TRα and TRβ receptors share 90% and 85% identity in the C and E domain. CAR also shows significant similarity to the Drosophila melanogaster ecdysone receptor (62% in the C domain, 29% in the E domain), making it the third member of a divergent subgroup in the superfamily. Southern blot analysis (by standard techniques) indicated that CAR may not be a member of a closely related subgroup like the TRs and the RARs.

Like a number of other members of the superfamily, CAR has a relatively long 5' untranslated region that contains several AUGs upstream of the start of the open reading frame. The function of this region is unknown.

Northern blot analysis (using standard techniques) indicated that CAR mRNA is expressed in a variety of human tissues, but is most abundant in liver. Multiple RNA species were observed, but the nature of these different species is not clear. Based on the multiple products expressed by genes encoding other superfamily members, it may well reflect alternative splicing or promoter utilization. CAR mRNA is also observed in a number of human cell lines, including HepG2 (hepatoma), JEG-3 (choriocarcinoma), and HeLa.

Ligand-independent transcriptional activation by the CAR ligand/dimerization domain

To study the function of the putative ligand binding domain of the orphan, a chimeric receptor consisting of the C-terminal D, E and F domains of CAR (i.e., approximately amino acids 76-172, 173-319, and 320-348 of the CAR sequence, respectively) fused to the N-terminal A/B and C domains of TRβ was generated (termed TR/CAR), along with a control hybrid with the A/B and C domains of TR and the D, E and F domain of the glucocorticoid receptor (TR/GR). Vectors expressing these chimeras or the intact TR were cotransfected into JEG3 cells (ATCC Accession No: HTB 36) with reporter plasmids containing T3REs (i.e., of sequence AAAGGTAAGATCAGGGACGTGACCGCAG; SEQ ID NO: 3) or GREs (from the MMTV promoter as described in Chandler et al., Cell 33:489, 1989) upstream of a reporter gene whose expression could be assayed. Transfections were carried out in the presence of serum treated with activated charcoal to remove T3 as well as other low molecular weight hydrophobic compounds that could act as CAR ligands.

Results from such an analysis revealed that both TR and the TR/GR chimera behaved as ligand-dependent transactivators, as expected. The relatively low but reproducible level of activation conferred by TR/GR is consistent with a previous report, and is thought to be a consequence of the absence from this hybrid of transcriptional activation domains present in the A/B domain of the intact GR and the F domain of the intact TR.

In contrast, the TR/CAR chimera activated expression of the T3RE containing reporter in the absence of any added ligand. This effect was not altered by addition of T3, estradiol, testosterone, retinoic acid, or dexamethasone, or by the addition of the orphan receptor ligand 25-hydroxy cholesterol. The constitutive activation conferred by this hybrid was substantial, corresponding to greater than 50% of the response of the intact TRβ. Similar results were observed with an analogous GR/CAR chimera in cotransfections with an MMTV/CAT reporter.

The activation conferred by the TR/CAR chimera in the presence of charcoal-stripped serum could be a consequence of either a direct, constitutive transcriptional activation function in the D, E or F domains of the orphan, or the presence of an uncharacterized ligand in the medium of the growing cells. To minimize the possibility of the presence of such a ligand in the media, the transfections were repeated in the absence of serum. Such conditions substantially reduced the level of both control and T3-activated expression, but did not prevent the constitutive activation function conferred by the TR/CAR chimera. Because of the substantial changes in the levels of expression from control promoters, the significance of the apparent decrease in constitutive activity relative to the level of activation conferred by TPβ in the presence of hormone is uncertain. This apparent decrease may reflect the absence of some specific stimulator of CAR function present in serum or could be a less specific effect associated with the unfavorable growth conditions.

Activation of retinoic acid response elements by CAR

Based on the constitutive activity of the TR/CAR chimera and the similarity of the DNA binding domain of CAR to other members of the superfamily, a number of response elements were screened for activation by the intact orphan in standard cotransfections. Despite the relatively close sequence relationship between CAR and the vitamin D receptor, no response was seen with the combined vitamin D/RA response element from the rat osteocalcin gene (i.e., of sequence TGGGTGAATGAGGACATTACTGACCGCTCCG; SEQ ID NO: 4). However, CAR did transactivate two wild type elements, the RAREs from the RARβ gene and the alcohol dehydrogenase (ADH) gene (i.e., GGGTAGGGTTCACCGAAAGTTCACTCG; SEQ ID NO: 5). RAREs which were not transactivated by CAR included a potent up mutant version of the rat growth hormone gene T3RE/RARE (i.e., of sequence AAAGGTAAGATCAGGGACGTGACCTCAG; SEQ ID NO:6; in tandem copies with the second copy inverted; described in Brent et al., Mol. Endocrinol. 3:1996, 1989) and the wild type version of the laminin T3RE/RARE (i.e., of sequence AGACAGGTTGACCCTTTTTCTAAGGGCTTAACCTAGCTCACCTG; SEQ ID NO: 8). The rat malic enzyme T3RE (i.e., of sequence AGGACGTTGGGGTTAGGGGAGGACAGTG; SEQ ID NO: 9) and the rat α-myosin heavy chain T3RE (i.e., of sequence CTGGAGGTGACAGGAGGACAGCAGCCCTGA; SEQ ID NO: 7), which do not respond to RARs, did not respond to CAR.

The activation of the RARβ element by CAR is not affected by several treatments that activate or inactivate signal transduction pathways mediated by protein kinases A or C. Thus, CAR activation was not altered by addition of dibutyryl cyclic AMP, by short or long term treatments with phorbol esters, or by cotransfections with vectors expressing the specific protein kinase A inhibitor peptide.

The activation of the RAREs by CAR suggests that this orphan could play an important role in the complex regulatory network that controls expression of RA responsive genes. To examine the functional interactions between CAR and RARβ, expression vectors for both were mixed in cotransfections. Increasing amounts of the CAR expression vector were added to a βRARE (SEQ ID NO: 5)-containing CAT reporter plasmid and a fixed amount of RARβ expression vector corresponding to approximately 2/3 of the level that is saturating for induction. In the absence of RA, the increasing amount of CAR led to an increase in basal expression. In the presence of RA, the high level of activated expression was not strongly affected by addition of CAR. Together, these effects resulted in a significant decrease in the RA induction ratio with increasing amounts of CAR.

These results indicate that the two superfamily members function independently at this response element. Analogous results were obtained when increasing amounts of RARβ were added to a subsaturating amount of CAR expression vector. In the absence of RA, the constitutive activation conferred by CAR could be blocked by excess RARβ. This is consistent with previous reports that RARs can repress expression in the absence of ligand. When retinoic acid was added, activation was observed at even moderate levels of RARβ. The levels of activated expression associated with the various doses of RARβ vector were similar to those observed in the absence of cotransfected CAR. It therefore seems most likely that CAR and RARβ do not interfere with each other when co-expressed at moderate levels. Under other circumstances, more complex indirect effects could be anticipated for the interaction between CAR and RXR (see below).

In contrast, addition of low levels of RXRα stimulated the effect of a subsaturating dose of CAR expression vector. Since similar results are observed with the receptors able to heterodimerize with RXRs, this result strongly suggests that RXRe may share a similar interaction with CAR.

DNA binding by CAR

To confirm that CAR binds the βRARE, nuclear extracts from HeLa cells infected with vaccinia virus vectors overexpressing FLAG® epitope-tagged versions of CAR or RARα were used in standard gel shift experiments. Both CAR- and RARα-containing extracts showed specific binding to the RARβ element (SEQ ID NO: 5), with RAR binding being of higher apparent affinity. In agreement with results of cotransfections, CAR binding was strongly stimulated by addition of nuclear extract from vaccinia infected HeLa cells overexpressing RXRα. However, little or no effect on affinity was observed when CAR and RAR were mixed. No evidence for formation of CAR/RAR heterodimers was observed in gels electrophoresed for longer times to better resolve the CAR and RAR shifted complexes. From these results, it appears that CAR can bind directly to the RARβ element, and that binding is strongly stimulated by RXRα.

The constitutive activity of CAR may be due to either a truly constitutive transcriptional activation function or the presence of some ubiquitous ligand. The former possibility is supported by its activity in serum free media and in distinct cell types. The existence of a ligand may be favored by the somewhat lower relative activity observed with serum free medium compared to medium containing charcoal stripped serum. However, the substantial changes in the expression of the control and activated promoters under these two quite distinct growth conditions suggest that less specific or direct effects could explain this difference.

Negative results have been obtained by several approaches designed to determine whether the constitutive activation of CAR is associated with various second messenger pathways. Evidence obtained to date indicates that CAR function is not affected by activation or repression of the activity of protein kinases A or C.

Based on the results presented here, it is possible that CAR plays two important roles in the complex, interlocking set of proteins that determines responses to RA, T3 and vitamin D. The first is to maintain a basal level of expression of a subset of RA responsive genes in the absence of the ligand. In the case of a cell expressing only RARβ, for example, this could allow expression of sufficient levels of the receptor to allow autoactivation of the RA-dependent positive feedback loop that regulates RARβ expression upon addition of ligand. The second potential function, is based on the interaction of CAR with RXR. Increasing expression of CAR would be expected to decrease the amount of RXR available for interaction with other heterodimeric partners. Thus, in a cell with limiting amounts of RXR, alterations in the amount or activity of CAR protein could have significant effects on the activity of RARs, T3Rs or VDR. Although the levels of CAR used in the cotransfections reported here did not show an antagonistic effect on RAR activity, preliminary results indicate that inhibitory effects of this type can be observed in other circumstances. Given the remarkable complexity of the regulatory networks that control response to the retinoids and the other ligands of this subgroup of the nuclear receptor superfamily, it is likely that even more complicated functions will be found for CAR.

Polypeptide Expression

Polypeptides according to the invention may be produced by transformation of a suitable host cell with all or part of a CAR receptor-encoding cDNA fragment (e.g., the cDNA described above) in a suitable expression vehicle.

Those skilled in the field of molecular biology will understand that any of a wide variety of expression systems may be used to provide the recombinant receptor protein. The precise host cell used is not critical to the invention. The CAR receptor may be produced in a prokaryotic host (e.g., E. coli) or in a eukaryotic host (e.g., Saccharomyces cerevisiae or mammalian cells, e.g., COS 1, NIH 3T3, and JEG3 cells). Such cells are available from a wide range of sources (e.g., the American Type Culture Collection, Rockland, Md.; also, see, e.g., Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989). The method of transfection and the choice of expression vehicle will depend on the host system selected. Transformation and transfection methods are described, e.g., in Ausubel et al. (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989); expression vehicles may be chosen from those provided, e.g., in Cloning Vectors: A Laboratory Manual (P. H. Pouwels et al., 1985, Supp. 1987).

One preferred expression system is the mouse 3T3 fibroblast host cell transfected with a pMAMneo expression vector (Clontech, Palo Alto, Calif.). pMAMneo provides: an RSV-LTR enhancer linked to a dexamethasone-inducible MMTV-LTR promotor, an SV40 origin of replication which allows replication in mammalian systems, a selectable neomycin gene, and SV40 splicing and polyadenylation sites. DNA encoding a CAR receptor polypeptide would be inserted into the pMAMneo vector in an orientation designed to allow expression. The recombinant receptor protein would be isolated as described below. Other preferable host cells which may be used in conjunction with the pMAMneo expression vehicle include COS cells and CHO cells (ATCC Accession Nos. CRL 1650 and CCL 61, respectively).

Alternatively, a CAR receptor polypeptide is produced by a stably-transfected mammalian cell line.

A number of vectors suitable for stable transfection of mammalian cells are available to the public, e.g., see Pouwels et al. (supra); methods for constructing such cell lines are also publicly available, e.g., in Ausubel et al. (supra). In one example, cDNA encoding the receptor polypeptide is cloned into an expression vector which includes the dihydrofolate reductase (DHFR) gene. Integration of the plasmid and, therefore, the CAR receptor-encoding gene into the host cell chromosome is selected for by inclusion of 0.01-300 μM methotrexate in the cell culture medium (as described in Ausubel et al., supra). This dominant selection can be accomplished in most cell types. Recombinant protein expression can be increased by DHFR-mediated amplification of the transfected gene. Methods for selecting cell lines bearing gene amplifications are described in Ausubel et al. (supra); such methods generally involve extended culture in medium containing gradually increasing levels of methotrexate. DHFR-containing expression vectors commonly used for this purpose include pCVSEII-DHRF and pAdD26SV(A) (described in Ausubel et al., supra). Any of the host cells described above or, preferably, a DHFR-deficient CHO cell line (e.g., CHO DHFR⁻ cells, ATCC Accession No. CRL 9096) are among the host cells preferred for DHFR selection of a stably-transfected cell line or DHFR-mediated gene amplification.

Once the recombinant CAR receptor polypeptide is expressed, it is isolated, e.g., using affinity chromatography. In one example, a CAR binding site (e.g., the βRARE site described above) or an anti-CAR receptor antibody (e.g., produced as described below) may be attached to a column and used to isolate the receptor polypeptide. Lysis and fractionation of receptor-harboring cells prior to affinity chromatography may be performed by standard methods (see, e.g., Ausubel et al., supra). Once isolated, the recombinant protein can, if desired, be further purified, e.g., by high performance liquid chromatography (see, e.g., Fisher, Laboratory Techniques In Biochemistry And Molecular Biology, eds., Work and Burdon, Elsevier, 1980).

Receptors of the invention, particularly short receptor fragments, can also be produced by chemical synthesis (e.g., by the methods described in Solid Phase Peptide Synthesis, 2nd ed., 1984 The Pierce Chemical Co., Rockford, Ill.).

These general techniques of polypeptide expression and purification can also be used to produce and isolate useful CAR receptor fragments or analogs (described below).

Identification of Ligands which Bind CAR Receptors

Although the CAR receptor described above was capable of activating some level of target gene expression in the apparent absence of a ligand, this does not discount the possibility that the receptor interacts with one or more ligands, e.g., to modulate receptor activity. Moreover, it is possible that multiple CAR receptors exist (e.g., as the products of a differentially-spliced CAR mRNA) and that different receptor species interact with different ligands. Accordingly, one aspect of the invention features a screening assay for the identification of compounds which specifically bind to the CAR receptors described herein. Such an assay may be carried out using a recombinant receptor protein.

In one example, the CAR receptor component is produced by a cell that naturally produces substantially no receptor or by a cell which produces functionally deficient receptor (i.e., a cell which apparently expresses CAR receptor mRNA as measured by Northern blotting but which does exhibit reporter gene induction, in the absence of recombinantly-produced CAR receptor, in a transactivation assay, see below); suitable cells are, e.g., those discussed above with respect to the production of recombinant receptor, most preferably, mammalian cells such as JEG3 cells. Host cells are transfected with (1) a vector which expresses a nucleic acid encoding the CAR receptor component (i.e., the "producer vector") and (2) a vector which includes a CAR receptor binding site (e.g., the βRARE sequence GGGTAGGGTTCACCGAAAGTTCACTCG; SEQ ID NO: 5; described above) positioned upstream of a target gene which may be assayed (e.g., a CAT gene or a β-galactosidase gene) (i.e., the "reporter vector"). Using such a standard transactivation assay procedure, CAR receptor activity is assayed by measuring CAR binding site-dependent target gene expression. CAR ligands are identified as those compounds which, when added to the host cell medium, effect a change in CAR receptor-directed gene expression (as detected using any CAR reporter vector); a CAR ligand according to the invention may either increase CAR receptor activity or decrease CAR receptor activity.

Any suitable transactivation technique, CAR receptor-encoding producer vector, and CAR receptor binding site-containing reporter vector may be used. Descriptions of transactivation assays and generally useful vectors for the identification of ligands which bind other nuclear hormone receptors are described, e.g., in Evans et al. (U.S. Pat. No. 4,981,784, 1991); Evans et al. (WO 90/07517); Evans et al. (W090/01428); and WO88/03168; all hereby incorporated by reference. CAR receptor polypeptides which may be used to screen for CAR ligands include wild-type molecules as well as any appropriate chimeric receptor, for example, the GR/CAR and TR/CAR receptors described above.

Candidate ligands may be purified (or substantially purified) molecules or the ligand may be one component of a mixture of ligands (e.g., an extract or supernatant obtained from cells; Ausubel et al., supra). In a mixed ligand assay, the CAR ligand is identified by testing progressively smaller subsets of the ligand pool (e.g., produced by standard purification techniques, e.g., HPLC or FPLC) until a single ligand is finally demonstrated to modulate the CAR receptor gene stimulatory activity. Candidate CAR ligands include peptide as well as non-peptide molecules.

Alternatively, a ligand may be identified by its ability to bind a CAR receptor polypeptide using affinity chromatography. Recombinant receptor is purified by standard techniques, from cells engineered to express the receptor (e.g., those described above); the recombinant receptor immobilized on a column (e.g., a Sepharose column or a streptavidin-agarose column by the immunoaffinity method of Ausubel et al., supra) and a solution containing one or more candidate ligands is passed through the column. Such a solution (i.e., such a source of candidate ligands) may be, e.g., a cell extract, mammalian serum, or growth medium on which mammalian cells have been cultured and into which the cells have secreted factors (e.g., growth factors) during culture; again, candidate CAR ligands include peptide as well as non-peptide molecules. A ligand specific for a recombinant receptor is immobilized on the column (because of its interaction with the receptor). To isolate the ligand, the column is first washed to remove non-specifically bound molecules, and the ligand of interest is then released from the column and collected.

CAR ligands isolated by the above methods (or any other appropriate method) may, if desired, be further purified (e.g., by high performance liquid chromatography; see above). Once isolated in sufficiently-purified form, a novel peptide ligand may be partially sequenced (by standard amino acid sequencing techniques). From this partial amino acid sequence, a partial nucleic acid sequence is deduced which allows the preparation of primers for PCR cloning of the ligand gene (e.g., by the method of Ausubel et al., supra).

Identification of CAR Receptor DNA Binding Sites

Identification of the CAR receptor facilitates identification of its DNA binding site(s). According to one approach, CAR receptor DNA binding sites may be identified using a transactivation assay, e.g., as described above for the identification of the binding site of sequence GGGTAGGGTTCACCGAAAGTTCACTCG (SEQ ID NO: 5). Briefly, candidate DNA binding sites are inserted upstream of a target gene (whose expression may be assayed, e.g., those genes described above) and the ability of a CAR receptor polypeptide to bind the DNA site is assayed as its ability to activate downstream gene expression.

Alternatively, a DNA binding site may be identified by selectively retaining a receptor-bound DNA fragment on a nitrocellulose filter. This approach relies on the ability of nitrocellulose to bind proteins but not double-stranded DNA. Purified CAR receptor polypeptide (e.g., purified by standard techniques from cells engineered to express the receptor, e.g., those described above) is mixed with labelled double-stranded DNA (e.g., a random pool of DNA fragments) under conditions which allow interaction. After incubation, the mixture is suction filtered through nitrocellulose, allowing unbound DNA to pass through the filter while retaining the protein and any DNA specifically bound to it. Bound DNA fragments are then eluted from the filter and analyzed by gel electrophoresis or amplification and cloning. A detailed description of this technique is published in Ausubel et al., supra).

Candidate DNA fragments for either approach may be derived from a randomly cleaved or sonicated genomic DNA library and/or may be derived from known nuclear hormone response elements (see, e.g., Evans et al., W090/11273).

Identification of CAR receptor DNA binding sites facilitates a search for the presence of such sites upstream of known or yet unidentified genes (e.g., by an examination of sequences upstream of known genes or by standard hybridization screening of a genomic library with binding site probes). CAR-mediated transcriptional control of genes bearing the binding site upstream may then be investigated (e.g., by transactivation experiments as described above), potentially leading to the elucidation of novel CAR receptor functions.

Chimetic Receptors

The functional domains of the CAR receptor may be swapped with the domains of other members of the nuclear hormone receptor family (see, e.g., Evans et al., WO 90/11273; Evans, Science 240:889, 1988) in order to produce receptors having novel properties. For example, if the DNA binding domain of the glucocorticoid receptor were fused to the gene activation domain of the CAR receptor, a novel receptor would be produced which could bind genes bearing an upstream glucocorticoid response element and activate gene expression in the absence of hormone. Conversely, fusion of the CAR DNA binding domain to the ligand-binding and gene activation domains of glucocorticoid receptor would confer hormonal regulation on genes downstream of CAR binding sites. Finally, fusion of the CAR DNA binding domain to a trans-repressing domain (see, e.g., Evans et al., WO90/14356) would result in repression of the basal level of expression of genes bearing upstream CAR binding sites. Construction of receptor fusion genes is carried out by standard techniques of molecular biology. CAR receptor domains are as follows: DNA binding domain, approximately amino acids 11-76; and gene activation and potential ligand binding domain, approximately amino acids 76-348. Examples of receptor domains which may be included in a chimeric CAR receptor are described in Evans et al. (WO 90/15815) and in Evans et al. (Science 240:889, 1988).

Dominant Negative Mutants

Mutants of the CAR receptor may be generated which interfere with normal CAR receptor activity. Such mutants are termed "dominant negative" and fall into at least two classes: (a) ones which bind to their DNA binding site (thereby interfering with the ability of wild-type receptor to bind the same site) and which do not activate gene expression and (b) ones which heterodimerize with other receptors (e.g., RXR) but which do not promote the biological response associated with the wild-type heterodimer.

The first class of CAR dominant negative mutants include those receptor polypeptides which contain a wild-type DNA binding domain and a mutant gene activation domain. Such mutants are unable to transactivate a reporter gene (e.g., as measured using a CAT reporter gene with an upstream βRARE and the standard methods described above) but retain the ability to bind a CAR DNA binding site (as evidenced, e.g., by DNA footprint analysis using a βRARE DNA sequence; Ausubel et al., supra).

The second class of CAR dominant negative mutants include those receptor polypeptides which contain a wild-type heterodimerization domain. Such a mutant interacts with its heterodimer partner and disrupts the partner's function. In one particular example, a dominant negative CAR receptor polypeptide may be overproduced (e.g., by directing its expression from a very strong promoter); the abundant CAR receptor polypeptide forms heterodimers with cellular RXR protein, soaking up available RXR and thereby preventing RXR homodimer formation as well as RXR heterodimer formation with other partner proteins (e.g., RAR, VDR, and T3R). Wild-type CAR receptor polypeptide may function as a dominant negative mutant if overproduced in this manner. However, a mutant CAR receptor lacking a gene activation domain (e.g., as identified above) and/or a DNA binding domain (e.g., as identified by DNA footprint analysis, above) is preferred.

Any of the above mutants may be generated by any method of random or site-directed DNA mutagenesis (see, e.g., Ausubel et al., supra).

Identification of Molecules which Modulate CAR Receptor Expression

Isolation of the CAR receptor gene also facilitates the identification of molecules which increase or decrease CAR receptor expression, and which may be useful as therapeutics, e.g., for treatment of cancers such as lung cancer, or for treatment of thyroid disorders such as Graves' disease. According to one approach, candidate molecules (e.g., peptide or non-peptide molecules found, e.g., in a cell extract, mammalian serum, or growth medium on which mammalian cells have been cultured) are added at varying concentrations to the culture medium of cells which express CAR receptor mRNA (e.g., HepG2, JEG-3, or HeLa cells). CAR receptor expression is then measured by standard Northern blot analysis (Ausubel et al., supra) using CAR receptor cDNA as a hybridization probe. The level of CAR receptor expression in the presence of the candidate molecule is compared to the level measured for the same cells in the same culture medium but in the absence of the candidate molecule. A molecule which promotes an increase or decrease in CAR receptor expression is considered useful in the invention.

Anti-CAR Receptor Antibodies

Human CAR receptor (or immunogenic receptor fragments or analogues) may be used to raise antibodies useful in the invention; such polypeptides may be produced by recombinant or peptide synthetic techniques (see, e.g., Solid Phase Peptide Synthesis, supra; Ausubel et al., supra). The peptides may be coupled to a carrier protein, such as KLH as described in Ausubel et al, supra. The KLH-peptide is mixed with Freund's adjuvant and injected into guinea pigs, rats, or preferably rabbits. Antibodies may be purified by peptide antigen affinity chromatography.

Monoclonal antibodies may be prepared using the CAR polypeptides described above and standard hybridoma technology (see, e.g., Kohler et al., Nature 256:495, 1975; Kohler et al., Eur. J. Immunol. 6:511, 1976;. Kohler et al., Eur. J. Immunol. 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T Cell Hybridomas, Elsevier, NY, 1981; Ausubel et al., supra).

Once produced, polyclonal or monoclonal antibodies are tested for specific CAR receptor recognition by Western blot or immunoprecipitation analysis (by the methods described in Ausubel et al., supra). Antibodies which specifically recognize a CAR receptor polypeptide are considered to be useful in the invention; such antibodies may be used, e.g., in an immunoassay to monitor the level of CAR receptor produced by a mammal.

Therapy

Because a lack of retinoic acid receptor has been associated with the occurrence of lung cancer and because the CAR receptor polypeptide binds and activates expression of the retinoic acid receptor gene, it is likely that the administration of a CAR receptor polypeptide to a mammal may prevent or treat cancers, particularly lung cancer. Similar therapeutic results would be expected for administration of a ligand which stimulates CAR receptor activity.

CAR receptor polypeptides may also find therapeutic use in the treatment of Graves disease, a disease resulting from an increase in thyroid hormone receptor function. RXR protein plays a role in thyroid hormone receptor expression. Accordingly, dominant negative CAR mutants which heterodimerize with RXR protein (including overexpressed wild-type CAR receptor protein) may act to decrease the cellular levels of available RXR and thereby decrease thyroid hormone receptor function. Again, ligands which increase heterodimerization efficiency could also be administered as a treatment for Graves' disease.

To treat the above diseases, the appropriate CAR receptor polypeptide (or ligand) is administered as a therapeutic preparation (e.g., in physiological saline) in accordance with the condition to be treated. Ordinarily, it will be administered intravenously, at a dosage effective to increase retinoic acid receptor expression (as a cancer treatment) or effective to decrease thyroid hormone receptor function (as a treatment for Graves' disease). Alternatively, it may be convenient to administer the therapeutic orally, nasally, or topically, e.g., as a liquid or a spray. Again, the dosages are as described above. Treatment may be repeated as necessary for alleviation of disease symptoms.

The methods of the invention may be used to reduce the disorders described herein in any mammal, for example, humans, domestic pets, or livestock. Where a non-human mammal is treated, the CAR receptor polypeptide or the antibody employed is preferably specific for that species.

Other Embodiments

Polypeptides according to the invention include the entire human CAR receptor (as described in FIG. 1; SEQ ID NO:10) as well as any analog or fragment of the human CAR receptor which includes either a DNA binding domain and a gene activation domain; or which includes a heterodimerization domain (as identified using the techniques described above).

Polypeptides of the invention also include all mRNA processing variants (e.g., all products of alternative splicing or differential promoter utilization) as well as CAR receptor proteins from other mammals.

Specific receptor fragments or analogues of interest include full-length or partial (see below) receptor proteins including an amino acid sequence which differs only by conservative amino acid substitutions, for example, substitution of one amino acid for another of the same class (e.g., valine for glycine, arginine for lysine, etc.) or by one or more non-conservative amino acid substitutions, deletions, or insertions located at positions of the amino acid sequence which do not destroy the receptors' ability to either bind DNA and activate transcription; or to interact with CAR receptor's heterodimerization partners (as assayed above). Analogs also include receptor polypeptides which are modified for the purpose of increasing peptide stability; such analogs may contain, e.g., one or more desaturated peptide bonds or D-amino acids in the peptide sequence or the peptide may be formulated as a cyclized peptide molecule.

Other embodiments are within the following claims.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 10                                                  (2) INFORMATION FOR SEQ ID NO: 1:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1450                                                               (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:                                       GTGAGCTTGCTCCTTAAGTTACAGGAACTCTCCTTATAATAGACACTTCATTTTCCTAGT60                 CCATCCCTCATGAAAAATGACTGACCACTGCTGGGCAGCAGGAGGGATGATAATCCTAAC120                TCCAATCACTGGCAACTCCTGAGATCAGAGGAAAACCAGCAACAGCGTGGGAGTTTGGGG180                AGAGGCATTCCATACCAGATTCTGTGGCCTGCAGGTGACATGCTGCCTAAGAGAAGCAGG240                AGTCTGTGACAGCCACCCCAACACGTGACGTCATGGCCAGTAGGGAAGATGAGCTGAGGA300                ACTGTGTGGTATGTGGGGACCAAGCCACAGGCTACCACTTTAATGCGCTGACTTGTGAGG360                GCTGCAAGGGTTTCTTCAGGAGAACAGTCAGCAAAAGCATTGGTCCCACCTGCCCCTTTG420                CTGGAAGCTGTGAAGTCAGCAAGACTCAGAGGCGCCACTGCCCAGCCTGCAGGTTGCAGA480                AGTGCTTAGATGCTGGCATGAGGAAAGACATGATACTGTCGGCAGAAGCCCTGGCATTGC540                GGCGAGCAAAGCAGGCCCAGCGGCGGGCACAGCAAACACCTGTGCAACTGAGTAAGGAGC600                AAGAAGAGCTGATCCGGACACTCCTGGGGGCCCACACCCGCCACATGGGCACCATGTTTG660                AACAGTTTGTGCAGTTTAGGCCTCCAGCTCATCTGTTCATCCATCACCAGCCCTTGCCCA720                CCCTGGCCCCTGTGCTGCCTCTGGTCACACACTTCGCAGACATCAACACTTTCATGGTAC780                TGCAAGTCATCAAGTTTACTAAGGACCTGCCCGTCTTCCGTTCCCTGCCCATTGAAGACC840                AGATCTCCCTTCTCAAGGGAGCAGCTGTGGAAATCTGTCACATCGTACTCAATACCACTT900                TCTGTCTCCAAACACAAAACTTCCTCTGCGGGCCTCTTCGCTACACAATTGAAGATGGAG960                CCCGTGTGGGGTTCCAGGTAGAGTTTTTGGAGTTGCTCTTTCACTTCCATGGAACACTAC1020               GAAAACTGCAGCTCCAAGAGCCTGAGTATGTGCTCTTGGCTGCCATGGCCCTGTTCTCTC1080               CTGACCGACCTGGAGTTACCCAGAGAGATGAGATTGATCAGCTGCAAGAGGAGATGGCAC1140               TGACTCTGCAAAGCTACATCAAGGGCCAGCAGCGAAGGCCCCGGGATCGGTTTCTGTATG1200               CGAAGTTGCTAGGCCTGCTGGCTGAGCTCCGGAGCATTAATGAGGCCTACGGGTACCAAA1260               TCCAGCACATCCAGGGCCTGTCTGCCATGATGCCGCTGCTCCAGGAGATCTGCAGCTGAG1320               GCCATGCTCACTTCCTTCCCCAGCTCACCTGGAACACCCTGGATACACTGGAGTGGGAAA1380               ATGCTGGGACCAAAGATTGGGCCGGGTTCAAAGGGAGCCCAGTGGTTGCAATGAAAGACT1440               AAAGCAAAAC1450                                                                 (2) INFORMATION FOR SEQ ID NO: 2:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 26                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:                                       TGYGAGGGNTGYAAGGSNTTYTTYMG26                                                   (2) INFORMATION FOR SEQ ID NO: 3:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:                                       AAAGGTAAGATCAGGGACGTGACCGCAG28                                                 (2) INFORMATION FOR SEQ ID NO: 4:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 31                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:                                       TGGGTGAATGAGGACATTACTGACCGCTCCG31                                              (2) INFORMATION FOR SEQ ID NO: 5:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 27                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:                                       GGGTAGGGTTCACCGAAAGTTCACTCG27                                                  (2) INFORMATION FOR SEQ ID NO: 6:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:                                       AAAGGTAAGATCAGGGACGTGACCTCAG28                                                 (2) INFORMATION FOR SEQ ID NO: 7:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:                                       CTGGAGGTGACAGGAGGACAGCAGCCCTGA30                                               (2) INFORMATION FOR SEQ ID NO: 8:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 44                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:                                       AGACAGGTTGACCCTTTTTCTAAGGGCTTAACCTAGCTCACCTG44                                 (2) INFORMATION FOR SEQ ID NO: 9:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28                                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:                                       AGGACGTTGGGGTTAGGGGAGGACAGTG28                                                 (2) INFORMATION FOR SEQ ID NO: 10:                                             (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 348                                                                (B) TYPE: amino acid                                                           (C) STRANDEDNESS: Not Relevant                                                 (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:                                      MetAlaSerArgGluAspGluLeuArgAsnCysValValCysGlyAsp                               151015                                                                         GlnAlaThrGlyTyrHisPheAsnAlaLeuThrCysGluGlyCysLys                               202530                                                                         GlyPhePheArgArgThrValSerLysSerIleGlyProThrCysPro                               354045                                                                         PheAlaGlySerCysGluValSerLysThrGlnArgArgHisCysPro                               505560                                                                         AlaCysArgLeuGlnLysCysLeuAspAlaGlyMetArgLysAspMet                               65707580                                                                       IleLeuSerAlaGluAlaLeuAlaLeuArgArgAlaLysGlnAlaGln                               859095                                                                         ArgArgAlaGlnGlnThrProValGlnLeuSerLysGluGlnGluGlu                               100105110                                                                      LeuIleArgThrLeuLeuGlyAlaHisThrArgHisMetGlyThrMet                               115120125                                                                      PheGluGlnPheValGlnPheArgProProAlaHisLeuPheIleHis                               130135140                                                                      HisGlnProLeuProThrLeuAlaProValLeuProLeuValThrHis                               145150155160                                                                   PheAlaAspIleAsnThrPheMetValLeuGlnValIleLysPheThr                               165170175                                                                      LysAspLeuProValPheArgSerLeuProIleGluAspGlnIleSer                               180185190                                                                      LeuLeuLysGlyAlaAlaValGluIleCysHisIleValLeuAsnThr                               195200205                                                                      ThrPheCysLeuGlnThrGlnAsnPheLeuCysGlyProLeuArgTyr                               210215220                                                                      ThrIleGluAspGlyAlaArgValGlyPheGlnValGluPheLeuGlu                               225230235240                                                                   LeuLeuPheHisPheHisGlyThrLeuArgLysLeuGlnLeuGlnGlu                               245250255                                                                      ProGluTyrValLeuLeuAlaAlaMetAlaLeuPheSerProAspArg                               260265270                                                                      ProGlyValThrGlnArgAspGluIleAspGlnLeuGlnGluGluMet                               275280285                                                                      AlaLeuThrLeuGlnSerTyrIleLysGlyGlnGlnArgArgProArg                               290295300                                                                      AspArgPheLeuTyrAlaLysLeuLeuGlyLeuLeuAlaGluLeuArg                               305310315320                                                                   SerIleAsnGluAlaTyrGlyTyrGlnIleGlnHisIleGlnGlyLeu                               325330335                                                                      SerAlaMetMetProLeuLeuGlnGluIleCysSer                                           340345                                                                         __________________________________________________________________________ 

We claim:
 1. Substantially pure DNA which encodes a polypeptide comprising the amino acid sequence of SEQ ID NO: 10 or a sequence which differs therefrom by one or more conservative amino acid substitutions, wherein said polypeptide comprises identically the DNA binding domain (amino acids 11-75) and the gene activation domain (amino acids 76-348) of SEQ ID NO: 10 and wherein said polypeptide is capable of binding to the DNA sequence of SEQ ID NO: 5 and activating expression of a gene operably linked thereto.
 2. Substantially pure DNA which encodes a polypeptide comprising:(i) the DNA binding domain (amino acids 11-75) of the CAR receptor sequence shown in SEQ ID NO: 10, or a sequence which differs therefrom by a single conservative amino acid substitution; and (ii) the gene activating domain (amino acids 76-348) of the CAR receptor sequence shown in SEQ ID NO: 10, or a sequence which differs therefrom by a single conservative amino acid substitution; wherein said polypeptide is capable of binding to the DNA sequence of SEQ ID NO: 5 and activating the expression of a gene operably linked thereto.
 3. Substantially pure DNA which encodes a polypeptide comprising a CAR receptor DNA binding domain which comprises a sequence identical to amino acids 11-75 of FIG. 1 (SEQ ID NO: 10) or which differs therefrom by a single conservative amino acid substitution, wherein said polypeptide is capable of binding to the DNA sequence of SEQ ID NO:
 5. 4. Substantially pure DNA which encodes a polypeptide comprising a CAR receptor heterodimerization domain which comprises a sequence identical to amino acids 173-319 of FIG. 1 (SEQ ID NO: 10).
 5. The substantially pure DNA of claim 1 or 2, wherein said DNA is cDNA.
 6. The substantially pure DNA of claim 1 or 2, wherein said DNA encodes a mammalian CAR receptor.
 7. The substantially pure DNA of claim 6, wherein said DNA encodes a human CAR receptor.
 8. Substantially pure DNA which encodes a polypeptide comprising the sequence of FIG. 1 (SEQ ID NO: 10).
 9. A vector comprising the substantially pure DNA of claim 1, 2, 3 or 4, said vector being capable of directing expression of the protein encoded by said DNA in a vector-containing cell.
 10. A cell which contains the substantially pure DNA of claim
 9. 11. The cell of claim 10, said cell being a eukaryotic cell.
 12. The cell of claim 11, said cell being a mammalian cell.
 13. A method of producing a recombinant CAR receptor polypeptide comprising:(a) providing a cell transformed with DNA encoding a CAR receptor of claim 1 or 2 positioned for expression in said cell; (b) culturing said transformed cell under conditions for expressing said DNA; and (c) isolating said recombinant CAR receptor polypeptide. 